Mixed-Initiative, Entity-Centric Data Aggregation using Assistopedia

نویسندگان

  • Matthew Michelson
  • Sofus A. Macskassy
  • Steven Minton
چکیده

Wikis allow for collaborators to collect information about entities. In turn, such entity information can be used for AI tasks, such as information extraction. However, these collaborators are almost exclusively human users. Allowing arbitrary software agents to act as collaborators can greatly enrich a wiki since agents can contribute structured data to complement the human-contributed, unstructured-data. For instance, agents can import huge volumes of structured data about entities, enriching the pages, and agents can update wiki pages to reflect real-time information changes (e.g., win-loss records in sports). This paper describes an approach that allows for both arbitrary software agents and human users to collaborate. In particular, we address three key problems: agents updating the correct wiki pages, policies for agent updates, and sharing the schema across collaborators. Using our approach, we describe creating entity-focused wikis which include the ability to create dynamic categories of entities based on their wiki pages. These categories dynamically update their membership based upon real-world changes.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Provenance in Open Data Entity-Centric Aggregation

An increasing number of web services these days require combining data from several data providers into an aggregated database. Usually this aggregation is based on the linked data approach. On the other hand, the entity-centric model is a promising data model that outperforms the linked data approach because it solves the lack of explicit semantics and the semantic heterogeneity problems. Howe...

متن کامل

Health systems research initiative to tackle growing road traffic injuries in India

Road traffic injuries (RTIs) are the sixth leading cause of deaths in India and about 400 deaths take place every day due to road traffic accidents. The present paper analyses the data of the India’s National Crime Record Bureau (NCRB) to assess the burden of RTI. In addition, it reports the health systems research initiated by the Indian Council of Medical Research (ICMR). As per NCRB data, in...

متن کامل

Towards ECSSE: live Web of Data search and integration

We illustrate the works toward implementing an Entity Centric Semantic Search Engine (ECSSE). ECSSE leverages the Sindice Semantic Web Index to find and combine together semantically structured data published on the web. With respect to previous Semantic Web Data integrators, ECSSE, uses an holistic approach in which large scale semantic web indexing, logic reasoning, data aggregation heuristic...

متن کامل

Mixed-Initiative Cyber Security: Putting humans in the right loop

Organizations and their computer infrastructures have grown intertwined in complex relationships through mergers, acquisitions, reorganizations, and cooperative service delivery. Consequently, defensive actions and policy changes by one organization may have far-reaching negative consequences on the partner organizations. Human-centric and machine-centric approaches are insufficient for defendi...

متن کامل

D3: Data-centric Data D Wireless Sensor

This paper presents a novel method to disseminate sensor data in a wireless sensor network, called D3 (Data-centric Data Dissemination). The method combines the advantages of data-centric routing like SPIN and directed diffusion and energyefficient MAC protocols such as S-MAC and T-MAC. The protocol’s strengths are its energy-efficiency and its simplicity. Messages are transmitted using broadca...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2010